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Minimax  Estimation  of  a  Multivariate  Normal  Mean 
under  a  Convex  Loss  Function 

by 

Pi-Erh  Lin*  and  Amany  Mousa 
Florida  State  University 

Summary 

Let  X  *  (X, ,  X  )'  'v-  N  (u,  E)  where  u  =  (P, ,  •••,  P„)'  and 

^  1  PP  1  P 

E  -  diagfoj,  ....  cjp)  are  both  unknown  and  p  2  3.  Let  (nj-2)Wj/o?  *v  . 

independent  of  (i  *  j  *  1,  p).  Assume  that  (w^,  ...»  w^)  and  X  are 

independent.  Define  W  *  diagfw^,  . ..,  w^)  and  ||x||^,  *  X'W~*Q”*W"*X  where 
Q  »  diagfqj,  qp)^  >  0,  i  *  1,  . . . ,  p.  In  this  paper,  the  minimax 

estimator  of  Berger  and  Bock  (Ann.  Statist.  4_  (1976) ,  642-648) ,  given  by 
&(X»  W)  »  [1^  -  rQC,  W)  HXH’VV1]^  is  shown  to  be  minimax  relative  to 
the  convex  loss  (6-jj)'[oQ  ♦  (l-a)E-1]  («-jj)/C,  where  C  =  a  tr(EQ)  ♦  (l-a)p 
and  0  <  a  5  1,  under  certain  conditions  on  r(X,  W) .  This  generalizes  the 
above-mentioned  result  of  Berger  and  Bock. 
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1.  Introduction.  Let  X  *  (X^,  ...»  X  )'  be  a  p- variate  (p  i  3) 
random  vector  normally  distributed  with  mean  u  *  (u, ,  ....  u  )'  and  co- 

i  p 

2  2  2 

variance  matrix  E  =  diag(Oj,  . ..,  o  )  inhere  a.,  i  =  1,  ...,  p,  are  unknown. 
This  paper  obtains  a  class  of  nioimax  estimators  for  ^  when  the  loss  in¬ 
curred  in  estimating  £  by  6  is  given  by 

(1.1)  L(6;  j i,  E)  *  (5-Er[c»q  +  (l-a)E_1]  («-n)/C, 

where  OSaSl,C  =  a  tr(QE)  +  (l-a)p,  and  Q  is  a  known  pxp  diagonal 
matrix  with  diagonal  elements  q.  >  0,  i  »  1,  ...»  p.  The  loss  (1.1)  is 
a  convex  combination  of  two  commonly  used  loss  functions,  namely 

(1-2)  L1(6;  K,  E)  *  («-E)'Q(«-H)/tr(Qn 

and 

(1.3)  L2(6;  M,  E)  -  (i-H)^'1^/?- 

The  loss  function  (6;  jj,  E)  may  be  used  when  the  relative  importance 
of  the  parameters  to  be  estimated  is  reflected  by  a  known  set  of  weights 
represented  by  Q,  while  the  loss  L2(£;  jj,  E)  represents  the  case  when  the 
relative  importance  is  reflected  naturally  by  the  inverse  of  the  covariance 
matrix  of  the  variables.  In  the  literature,  statisticians  tend  to  use 
either  Lj(6;  jj,  E)  or  L2(£;  jj,  E)  as  a  loss  function.  In  the  convex  loss 
(1.1)  we  have  combined  both  viewpoints  in  such  a  way  that  the  more  a  is 
near  1  the  more  loss  is  to  be  assessed  by  Lj(£;  jj,  E),  and  vice  versa. 

It  is  clear  that  the  maximum  likelihood  estimator,  &,  is  minimax  with 
risk  RQC,  jj)  ■  E„L(X;  jj,  E)  ■  1.  Thus  an  estimator  £  will  be  minimax  under 


I 
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loss  (1.1)  if  and  only  if  R(X,  ji)  -  R(6,  jj)  a  0  for  all  i»  and  E.  In  evalu¬ 
ating  the  difference  in  risk  it  is  necessary  to  impose  certain  conditions 
on  E.  Specifically,  the  values  or  bounds  of  the  trace  of  E~*Q"*,  denoted 
by  tr(E"*Q"*),  and  the  minimum  characteristic  root  of  QE,  denoted  by 
chmin(Q£) >  will  be  assumed.  Similar  conditions  have  been  noted  by  various 
authors.  For  example,  Gleser  (1976)  shows  that  if  a  lower  bound  for 

ch  .  (QE)  is  known  then  a  family  of  minimax  estimators  for  u  can  be  ob- 
min 

tained  under  the  loss  function  Lj(6;  jj,  E);  otherwise  no  estimator  of  the 
form  [1^  -  h(X'W~^x)Q~^W~^]X  can  be  minimax  for  jj  unless  h(u)  «  0  for 
almost  all  u  2  0  where  W  'v-  W(n,  E)  independent  of  X  and  h:  R  ■+  R  is  a 
real  function  satisfying  certain  conditions.  In  our  case  where  both  0 
and  E  are  diagonal  matrices  a  sufficient  set  of  conditions  on  E  would  be 

(1.4)  max  (a?)  s  1/c.  and  min  (a?)  2:  l/c_ 

lsi*p  1  1  lsiSp  1 

for  some  positive  constants  c^  and  (0  <  Cj  s  c^  <  ■) .  This  set  of 
conditions  on  E  is  not  unreasonable  in  application. 
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2.  Useful  Lemmas.  In  establishing  the  minimaxity  of  an  estimator 
of  the  mean  of  a  multivariate  normal  distribution  the  following  lemmas  are 
very  useful.  They  are  presented  here  for  ease  of  reference  and  without 
proof. 

Lemma  2.1.  [Stein  (1974)].  Let  Y  'v  N(0,  1)  and  let  g  be  an  absolutely 
continuous  function,  g:  R  -*■  R.  Then 

Ey[g'(Y)]  =  E^YsCY)] 

provided  that  all  expectations  exist  and  are  finite. 

Lemma  2.2.  [Efron  and  Morris  (1976)].  Let  U  'v  x*  and  let  g  be  as 
defined  in  Lemma  2.1.  Then 

Ey[Ug(U)]  *  nEylgOJ)]  +  2Erj[Ug'(U)] 
provided  that  all  expectations  exist  and  are  finite. 

Corollary  2.2.1.  Let  U  and  g  be  as  defined  in  Lemma  2.2.  Let 
Z  =  cU/(n-2),  c  >  0,  and  h(Z)  =  g[(n-2)Z/c].  Then 

Ez[(n-2)Zh(Z)/c]  =  nEzh(Z)  ♦  2Ez[Zh'(Z)] 
provided  that  all  expectations  exist  and  are  finite. 

Lemma  2,3.  [Lehmann  (1966)].  Let  S  be  any  random  variable,  and  let 
Pj(S)  and  pz(S)  map  the  real  line  into  itself.  If  Pj(S)  and  pz(S)  are  either 
both  nonincreasing  in  S  or  both  nondecreasing  in  S,  then 

Es[p1(S)p2(S)]  2  Es[Pj(S)]Es[p2(S)]. 
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Note  that  Lemmas  2.1  and  2.2  may  be  proved  by  integration  by  parts 
and  the  corollary  by  the  indicated  change  of  variable.  Lemma  2.3  follows 
from  a  new  concept  of  dependency,  namely  the  positive  quadrant  dependence, 
introduced  by  Lehmann  (1966).  The  above  results  will  be  employed  in  the 
proof  of  the  main  theorem  in  the  next  section. 
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3.  Main  Result.  Let  X  be  a  p- variate  (p  £  3)  random  vector  normally 
distributed  with  unknown  mean  ji  and  unknown  covariance  matrix 
X  =  diag(a*,  a2) .  Assume  that  (nj-2)w^/a?  <v  x£  ,  0^  >  2),  i  =  1,  p, 

where  and  w^  (i  *  j  -  1,  p)  are  mutually  independent  and  are  inde¬ 

pendent  of  X.  As  in  Berger  and  Bock  (1976),  define  V/  =  diag(Wj,  ....  wp), 

T  =  min  (x2  /n.)»  and  t  =  T(n.,  ....  n  )  *  E(T**).  In, the  following  theorem 
lsi<p  ni  1  1  P 

we  will  obtain  a  class  of  minimax  estimators  for  jj  relative  to  the  loss 
function  given  by  (1.1). 


Theorem.  The  estimator 


(3.1) 


6(X,  W)  =  [Ip  -  r(&,  W)||X||-2Q'V1])C 


is  minimax  for  relative  to  the  loss  function  (1.1)  with  Q  =  diag(qj,  ...» 
where  ||x||w  *  X'W  Q*  W  X,  provided  that  the  following  conditions  are 
satisfied 


(i)  0  s  r(X,  W)  < 


2{<x(P-2t)  +  (1-a)  [tr(Z-1Q-1)  -  Zxch^E" lQ"1)] ) 


with 


a(p-2t)  ♦  (l-o)  [tr (Z~1Q~1)  -  ZTchfz’V1) )  J  0  and  0  s  a  s  1, 

max 

(ii)  r(jK,  W)  is  nondecreasing  in  |X.J,  i  =  1,  . . . ,  p, 

(iii)  r(X,  W)  is  nonincreasing  in  w.,  i  =  1,  ...,  p, 

-2 

(iv)  r(X,  W)||x||~  is  nondecreasing  in  w^,  i  =  1,  ....  p. 


Proof.  Write  6  *  £(X,  W)  and  r  *  r(X,  W).  Let  A  »  R(X,  ji)  -  R(6,  ji) . 

Then 


CA  -  { 2r || X M ~ 2 (X-ii) - fc-.Q  ♦  (I-cOe'Mq'V1^ 

-  r 2 II X 1 1  w4X  ' w'  1Q_  1  C<»Q  ♦ 

»  2(aAj  ♦  (l-ajAj],  say. 


(3.2) 
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where 


A1  "  ,  x [r  H  X 1 1  CX-M)  -W" 1 X  -  r2||x||-2/2] 

and 

a2  *  Vxtr!!£llw2(£-^'r*1Q’lw"1£  -  ^HxHwVwV^'W^]. 

In  the  following  we  will  find  appropriate  lower  bounds  for  A^  and 
which,  in  conjunction  with  Condition  (i),  will  establish  that  A  2  0.  But 
a  lower  bound  for  Aj  has  been  obtained  by  Berger  and  Bock  (1976,  Eq.  (2.10)), 
namely 

(3-3)  Aj  *  Ex(Hw(r||x||~2)  [p  -  2t  -  E^r/2)]}. 

We  will  proceed  to  find  a  lower  bound  for  A£.  Let 


(3.4) 


where 

(3.5) 

and 

(3.6) 


A2  =  &21  +  A22 


A2i  *  ^  xHIXll^CS-l&J'S'W1^] 

A22  "  EW  x[-(l/2)r2||x||'4X'W-1Q-1E‘W1X] 


Then,  taking  expectation  with  respect  to  )(  first  and  then  W,  and  by  an 
application  of  Lemma  2.1,  we  have 
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(3.7) 


*21  =  EW,X  Jj  oiq1.wi  ~T~ 

—  \  A  L 

e  e"*[L  A  (r!lxll«2xi) 


Xi  3r 


W>~Li=l[l!x!lJ  ‘  ilxl^q-w*  +  ||X||J  3Xi 


1  2r 


W'~LmJ  i=l  qiWi 


P  x’l 
4  .  ,  2  3  ■ 

w  1=1  W 


The  last  inequality  follows  from  Condition  (ii)  since  X^(3r/3X^)  i  0  for 
all  i  =  1,  p.  The  first  term  in  the  right-hand  side  of  (3.7)  may  be 

further  evaluated  by  taking  the  expectation  first  with  respect  to  W  and 
then  with  respect  to  X.  Recall  that  (n^Jw^o?  -v  .  Then,  it  follows 

l 

from  Corollary  2.2.1  with  h(w.)  =  r/dfxH^)  that,  for  each  i  -  1,  ....  p. 


.  viE  Ij.  1  .  !E  [l  >.  2fXi  r 

"ilnjsig  "iLiisiiJ 9Wi  iixiiivJ  «£ii5- 


Therefore, 


f  r  1  T  r  4rXi 

(^•®)  Y  4  m  3  Y  - m'4  ^  *  - - r  ■  -  —  -  — 

MlWlfrJ  ’-Ulxll^i  (ni_2)  |IX ||^q.w?  (n.- 

f  r  4rX? 

^  Fw,~UIX|l^i  0*1-2)  llEllJJqiW*  * 


2)  lljllj 


The  last  inequality  follows  by  Condition  (iii).  Now,  substituting  (3.8) 
into  the  last  expression  of  (3.7),  we  have 


2  Ew  XC  Cr||x||"2)  (tr(E*  V1)  -  2T_1chmax(E_1Q'1)]>. 

*  r-u 


Substituting  (3.9)  and  (3.6)  into  (3.4),  we  obtain 

(3.10)  A2  2  ^  x{(r||x||^2)(tr(E'1Q"1)-2T‘1chmax(E"V1)-(r/2)chniax(Z"1Q~1)]} 

*  Ex{Ew(r||x||-2)(tr(E"1Q’1)-2TChmax(E_V1)-Ew(r/2)chmax(E'1Q-1)]}. 

The  last  inequality  in  (3.10)  is  obtained  by  an  application  of  Lemma  2.3  since 
rllxllfl  is  nondecreasing  in  w^  and 

[tr(E1Q_1)  -  2T-1ch  (E"^’1)  -  (r/2)chm  (z'V1)]  is  also  nondecreasing 

max  max 

in  w^  for  i  *  1,  ...,  p,  and  since  w^  and  w^  (i  *  j)  are  mutually  independent. 
Hence,  upon  the  substitution  of  (3.10)  and  (3.3)  into  (3.2),  we  conclude  that 

(3.11)  CA/2  =  oAj  ♦  (l-a)A2 

*  Ex{Ew(r|IX||;2) 

x  (o(p-2t)  ♦  (1-a)  [tr(Z_1Q_1)  -  2t  ch^E'V1)  ] 

-  (a  ♦  (1-a) ch  (E~ 1Q’ X) ] Ew(r/2) } } . 


-if»»ineiii 


VA*inai  *wa*rn^nwi 


*»***•**. 


0 

By  Condition  (i),  it  is  clear  that  the  last  expression  of  (3.11)  is  non¬ 
negative  for  all  p  and  E  and  hence  6  is  minimax  proving  the  theorem. 

<s *  '"w 

Remarks.  (1)  Unless  Condition  (i)  is  satisfied,  the  theorem  is  vacuous. 
Thus  it  is  required  not  only  that  tr(E”*Q"*)  and  c^maxf^ *0 **)  be  known  but 
that,  for  0  s  a  <  1, 

(3.12)  a(p-2t)  +  (l-o) (tr(E-1Q-1)  -  2t  ch^a'V1) ]  ;>  0. 

On  the  other  hand,  if  there  exists  positive  constants  c^  and  c 2 
(0  <  Cj  s  c2  <  <«)  such  that  (1.4)  holds  or,  equivalently, 

'W1'1’  2  V  'W1'*’  *  c2 

and 

p  >  2t[a  +  (l-o)c2  chmax(Q'1)]/(a  +  (l-a)^  ch^fQ'1)], 

then  the  theorem  remains  true  with  Condition  (i)  replaced  by 

2{a(p-2r)  +  (1-a) [c.tr(Q'1)  -  2x  c_  ch  (Q-1)]} 

(i')  0  £  rQC,  W)  s - - - 

a  ♦  (l-a)c2  chmax(Q  ) 

(2)  Berger  and  Bock  (1976)  show  that  the  estimator  6(X,  W)  given  by 
(3.1)  is  minimax  for  ji  under  the  loss  Lj(6;  jj,  E)  and  that  there  exists  a 
large  positive  integer  N  such  that,  for  all  n^  i  N,  p-2r  >  0  and  p  2  3. 
Furthermore,  they  provide  a  formula  for  calculating  t  when  the  n^‘s  are  even. 
In  this  paper,  we  employ  their  method  of  proof  to  establish  our  result.  It 
should  be  noted  that  our  result  reduces  to  theirs  when  a  ■  1. 


J 
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